Regarding Scenes

نویسنده

  • John M. Henderson
چکیده

When we view the visual world, our eyes flit from one location to another about three times each second. These frequent changes in gaze direction result from very fast saccadic eye movements. Useful visual information is acquired only during fixations, periods of relative gaze stability. Gaze control is defined as the process of directing fixation through a scene in real time in the service of ongoing perceptual, cognitive, and behavioral activity. This article discusses current approaches and new empirical findings that are allowing investigators to unravel how human gaze control operates during active real-world scene perception. KEYWORDS—scene perception; real-world scene; visual saliency; eye movements; gaze control; visual context It has been known for at least 30 years that the gist of a scene can be apprehended very rapidly, well within the duration of a single fixation or period of relative gaze stability (Potter, 1976). It has been known for even longer that viewers tend to move their eyes through scenes when they look at them (Buswell, 1935; Yarbus, 1967; see Fig. 1). Given fast gist understanding, why do we bother to move our eyes? Recent studies of change detection, object identification, and scene memory show that close or direct fixation is necessary to perceive local visual details, to unambiguously identify objects, and to encode object and scene information into shortand long-term memory (Henderson & Hollingworth, 1999; Hollingworth & Henderson, 2002). What we see and understand about the visual world is tightly tied to where our eyes are pointed. Why is fixation critical to these perceptual and cognitive processes? First, high-resolution visual information is acquired from only a very limited region of the scene surrounding the fixation point, with visual quality falling off precipitously and continuously from central vision into a low-resolution visual surround. The high resolving power of central vision is partly a consequence of the optical and anatomical structure of the eye and retina. Also, ‘‘cortical magnification’’ preferentially maps central vision onto the visual cortex, ensuring that more computational power is devoted to fixated regions. Therefore, to acquire high-quality visual input from a scene region, fixation must be directed to it. Second, there is a very tight link between attention and fixation. Although visual-spatial attention can be dissociated from where the eyes are fixated in laboratory demonstrations, attention is typically directed to the fixated location and the location to be fixated next. Attention is time-locked to eye-movement dynamics as eye-movement control circuits hold fixation and then release the eyes to the next fixation site. This time-locked relationship between shifts in attention and gaze appears to be mandatory, in part due to the tight neural integration of systems that control covert attention and those that control eye movements. Given the importance of fixation for perceptual and cognitive processing during scene perception, a critical issue concerns the representations and processes that govern where and for how long the eyes are directed to a particular scene region. This issue has become the focus of intense investigation in the past few years, with recent research emphasizing two general classes of factors that may drive gaze: bottom-up image properties, and cognitive knowledge structures used in a top-down manner.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Translation and Hybridity in Scenes and Frames Semantics

 The present study is a theoretical attempt to illustrate how Fillmore's Scenes and Frames Semantics (SFS) could be employed as a framework to portray the process of understanding and translating hybrid texts. It first reviews the origin of SFS; then it maps SFS onto Nida’s linguistic model of translation process and the Interpretive Theory of Translation; it examines in the next section, withi...

متن کامل

Technicolor/INRIA Team at the MediaEval 2013 Violent Scenes Detection Task

This paper presents the work done at Technicolor and INRIA regarding the MediaEval 2013 Violent Scenes Detection task, which aims at detecting violent scenes in movies. We participated in both the objective and the subjective subtasks.

متن کامل

Development of Prototype Autostereoscopic Imaging Systems and Applications

In this thesis, prototype 3D imaging systems are developed regarding the capturing and reproduction of 3D scenes based on Integral Photography (IP) technique, for both synthetic and real 3D scenes. Regarding the capturing of synthetic 3D scenes, an IP generator is designed based on computer simulation, by exact modeling of all the necessary optical components of a single stage IP capturing syst...

متن کامل

تأثیر مشاهدۀ سیمای طبیعت و سیما با آوای طبیعت بر اضطراب مرحلۀ اول زایمان مادران نخست‌زا

Introduction: Although delivery is a natural and physiological process, it creates a great deal of anxiety in mothers. This study aimed to assess the effect of viewing scenes of nature and nature scenes along with nature sounds on the anxiety during the active phase in primiparous women. Methods: In this clinical trial, 90 primiparous women in active phase of labor were selected through conven...

متن کامل

Online multiple people tracking-by-detection in crowded scenes

Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifie...

متن کامل

On the Use of ROMOT—A RObotized 3D-MOvie Theatre—To Enhance Romantic Movie Scenes

In this paper, we introduce the use of ROMOT—a RObotic 3D-MOvie Theatre—to enhance love and sex movie scenes. ROMOT represents the next generation of movie theatres, where scenes are enhanced with multimodal content, also allowing audience interaction. ROMOT is highly versatile as it can support different setups, integrated hardware and content and, thus, it can be easily adapted to different g...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007